Model Selection

Lightweight Visual Question Answering

# Lightweight Visual Question Answering

Moondream 2b 2025 04 14 4bit

Moondream is a lightweight vision-language model designed for efficient cross-platform deployment. The 4-bit quantized version released on April 14, 2025 significantly reduces memory usage while maintaining high accuracy.

Dermatech Qwen2 VL 2B GGUF

This is a multimodal model based on the Qwen2 architecture, supporting text generation, image-to-text, and visual question answering tasks, with multiple quantized versions to meet diverse needs.

Image-to-Text English

Qwen2 VL 2B Instruct GGUF

Qwen2-VL-2B-Instruct is a multimodal vision-language model that supports image-text generation tasks, based on the Qwen2 architecture with a parameter scale of 2B.

Image-to-Text English

Tinyllava 1.1b V0.1

A lightweight visual question answering model based on TinyLlama-1.1B, trained using the BakLlava codebase, supporting image content understanding and question-answering tasks.

Tinyllava 1.1b V0.1

A lightweight visual question answering model based on TinyLlama-1.1B, trained using the BakLlava codebase

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase